skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Zhao, Xinyu"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Free, publicly-accessible full text available July 1, 2026
  2. Long-context LLMs are increasingly in demand for applications such as retrieval-augmented generation. To defray the cost of pretraining LLMs over long contexts, recent work takes an approach of synthetic context extension: fine-tuning LLMs with synthetically generated long-context data in a post-training stage. However, it remains unclear how and why this synthetic context extension imparts abilities for downstream long-context tasks. In this paper, we investigate fine-tuning on synthetic data for three long-context tasks that require retrieval and reasoning. We vary the realism of "needle" concepts to be retrieved and diversity of the surrounding "haystack" context, from using LLMs to construct synthetic documents to using templated relations and creating symbolic datasets. We find that models trained on synthetic data fall short of the real data, but surprisingly, the mismatch can be interpreted and even predicted in terms of a special set of attention heads that are responsible for retrieval over long context, retrieval heads (Wu et al., 2024). The retrieval heads learned on synthetic data have high overlap with retrieval heads learned on real data, and there is a strong correlation between the recall of heads learned and the downstream performance of a model. Furthermore, with attention knockout and activation patching, we mechanistically show that retrieval heads are necessary and explain model performance, although they are not totally sufficient. Our results shed light on how to interpret synthetic data fine-tuning performance and how to approach creating better data for learning real-world capabilities over long contexts. 
    more » « less
    Free, publicly-accessible full text available May 27, 2026
  3. Free, publicly-accessible full text available May 1, 2026
  4. Abstract ObjectiveExtracting social determinants of health (SDoHs) from medical notes depends heavily on labor-intensive annotations, which are typically task-specific, hampering reusability and limiting sharing. Here, we introduce SDoH-GPT, a novel framework leveraging few-shot learning large language models (LLMs) to automate the extraction of SDoH from unstructured text, aiming to improve both efficiency and generalizability. Materials and MethodsSDoH-GPT is a framework including the few-shot learning LLM methods to extract the SDoH from medical notes and the XGBoost classifiers which continue to classify SDoH using the annotations generated by the few-shot learning LLM methods as training datasets. The unique combination of the few-shot learning LLM methods with XGBoost utilizes the strength of LLMs as great few shot learners and the efficiency of XGBoost when the training dataset is sufficient. Therefore, SDoH-GPT can extract SDoH without relying on extensive medical annotations or costly human intervention. ResultsOur approach achieved tenfold and twentyfold reductions in time and cost, respectively, and superior consistency with human annotators measured by Cohen's kappa of up to 0.92. The innovative combination of LLM and XGBoost can ensure high accuracy and computational efficiency while consistently maintaining 0.90+ AUROC scores. DiscussionThis study has verified SDoH-GPT on three datasets and highlights the potential of leveraging LLM and XGBoost to revolutionize medical note classification, demonstrating its capability to achieve highly accurate classifications with significantly reduced time and cost. ConclusionThe key contribution of this study is the integration of LLM with XGBoost, which enables cost-effective and high quality annotations of SDoH. This research sets the stage for SDoH can be more accessible, scalable, and impactful in driving future healthcare solutions. 
    more » « less
    Free, publicly-accessible full text available June 10, 2026
  5. Functional Magnetic Resonance Image (fMRI) is commonly employed to study human brain activity, since it offers insight into the relationship between functional fluctuations and human behavior. To enhance analysis and comprehension of brain activity, Graph Neural Networks (GNNs) have been widely applied to the analysis of functional connectivities (FC) derived from fMRI data, due to their ability to capture the synergistic interactions among brain regions. However, in the human brain, performing complex tasks typically involves the activation of certain pathways, which could be represented as paths across graphs. As such, conventional GNNs struggle to learn from these pathways due to the long-range dependencies of multiple pathways. To address these challenges, we introduce a novel framework BrainMAP to learn multiple pathways in brain networks. BrainMAP leverages sequential models to identify long-range correlations among sequentialized brain regions and incorporates an aggregation module based on Mixture of Experts (MoE) to learn from multiple pathways. Our comprehensive experiments highlight BrainMAP's superior performance. Furthermore, our framework enables explanatory analyses of crucial brain regions involved in tasks. 
    more » « less
    Free, publicly-accessible full text available April 11, 2026
  6. We consider Euler flows on two-dimensional (2-D) periodic domain and are interested in the stability, both linear and nonlinear, of a simple equilibrium given by the 2-D Taylor–Green vortex. As the first main result, numerical evidence is provided for the fact that such flows possess unstable eigenvalues embedded in the band of the essential spectrum of the linearized operator. However, the unstable eigenfunction is discontinuous at the hyperbolic stagnation points of the base flow and its regularity is consistent with the prediction of Lin (Intl Math. Res. Not., vol. 2004, issue 41, 2004, pp. 2147–2178). This eigenfunction gives rise to an exponential transient growth with the rate given by the real part of the eigenvalue followed by passage to a nonlinear instability. As the second main result, we illustrate a fundamentally different, non-modal, growth mechanism involving a continuous family of uncorrelated functions, instead of an eigenfunction of the linearized operator. Constructed by solving a suitable partial differential equation (PDE) optimization problem, the resulting flows saturate the known estimates on the growth of the semigroup related to the essential spectrum of the linearized Euler operator as the numerical resolution is refined. These findings are contrasted with the results of earlier studies of a similar problem conducted in a slightly viscous setting where only the modal growth of instabilities was observed. This highlights the special stability properties of equilibria in inviscid flows. 
    more » « less
    Free, publicly-accessible full text available November 25, 2025
  7. Free, publicly-accessible full text available January 1, 2026
  8. We present a method of detecting bifurcations by locating zeros of a signed version of the smallest singular value of the Jacobian. This enables the use of quadratically convergent root-bracketing techniques or Chebyshev interpolation to locate bifurcation points. Only positive singular values have to be computed, though the method relies on the existence of an analytic or smooth singular value decomposition (SVD). The sign of the determinant of the Jacobian, computed as part of the bidiagonal reduction in the SVD algorithm, eliminates slope discontinuities at the zeros of the smallest singular value. We use the method to search for spatially quasi-periodic traveling water waves that bifurcate from large-amplitude periodic waves. The water wave equations are formulated in a conformal mapping framework to facilitate the computation of the quasi-periodic Dirichlet-Neumann operator. We find examples of pure gravity waves with zero surface tension and overhanging gravity-capillary waves. In both cases, the waves have two spatial quasi-periods whose ratio is irrational. We follow the secondary branches via numerical continuation beyond the realm of linearization about solutions on the primary branch to obtain traveling water waves that extend over the real line with no two crests or troughs of exactly the same shape. The pure gravity wave problem is of relevance to ocean waves, where capillary effects can be neglected. Such waves can only exist through secondary bifurcation as they do not persist to zero amplitude. The gravity-capillary wave problem demonstrates the effectiveness of using the signed smallest singular value as a test function for multi-parameter bifurcation problems. This test function becomes mesh independent once the mesh is fine enough. 
    more » « less
  9. We present a numerical study of spatially quasi-periodic gravity-capillary waves of finite depth in both the initial value problem and travelling wave settings. We adopt a quasi-periodic conformal mapping formulation of the Euler equations, where one-dimensional quasi-periodic functions are represented by periodic functions on a higher-dimensional torus. We compute the time evolution of free surface waves in the presence of a background flow and a quasi-periodic bottom boundary and observe the formation of quasi-periodic patterns on the free surface. Two types of quasi-periodic travelling waves are computed: small-amplitude waves bifurcating from the zero-amplitude solution and larger-amplitude waves bifurcating from finite-amplitude periodic travelling waves. We derive weakly nonlinear approximations of the first type and investigate the associated small-divisor problem. We find that waves of the second type exhibit striking nonlinear behaviour, e.g. the peaks and troughs are shifted non-periodically from the corresponding periodic waves due to the activation of quasi-periodic modes. 
    more » « less